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Abstract 

In previous papers, we expressed the Entanglement of Formation in terms of Condi- 
tional Mutual Information (CMI). In this brief paper, we express the Entanglement 
of Distillation in terms of CMI. 



1 Introduction 



Entanglement of Formation (Ep) and Entanglement of Distillation (-Ed) were in- 
vented by Bennett et al in Ref.[^ and satellite papers. In a series of previous papers[0], 
we showed how to express Ep in terms of Conditional Mutual Information (CMI), 
but we said nothing about Eo- In this brief letter, we will show how to express Ed in 
terms of CMI. Recently, other researchers have expressed some of their entanglement 
ideas in terms of unconditional mutual information. See, for example, Ref. ||^. 

Two reasons why CMI is useful for quantifying entanglement are the follow- 
ing. First, entanglement is an "exclusively quantum" effect. CMI satisfies this re- 
quirement. It vanishes in the classical regime, but not in the quantum regime, for a 
fiducial experiment. Second, entanglement is associated with a correlation between 
two events a and b. But there must be something to distinguish entanglement cor- 
relations from classical correlations. CMI satisfies this requirement too. It measures 
more than just the correlation of a and b. Those two events are assumed to have 
a common ancestor event (or cause, or antecedent) in their past, call it A, and we 
condition on that common ancestor. (See Fig.|l]) For example, in Bohm's version of 
the EPR experiment, A might correspond to the event of a spin-zero particle breaking 
up into two spin-half particles with opposite spins. 

We will try to make this paper as self contained as we can for such a short 
document. If the reader has any questions concerning notation or definitions, we 
refer him to Ref. |^ — a much longer, tutorial paper that uses the same notation as 
this paper. 

We will represent random variables by underlined letters. Sa will be the set 
of all possible values that a can assume, and Ng, will be the number of elements in 
Sa- Sa,b will represent the Cartesian product of sets Sa and Sb- In the quantum case, 
Tia will represent a Hilbert space of dimension Ng,. 'Hg,.b will represent the tensor 
product of Tia and Ht- Red indices should be summed over (e.g. aibi = Y.iO'ih)- 
pd(S'a) will denote the set of all probability distributions on Sq,, -P(a) > such that 
Y^a&Sa ^i^) — 1- dm(7ia) will denote the set of all density matrices acting on Hg,- 

As usual0, for any three random variables, a, 6, A, we define the mutual in- 
formation{y[\) by 

H{a : b) = H{a) + H{b) - H{a, b) , (1) 
and the conditional mutual information{CMl) by 

H{a:b\X) = H{a\X) + H{b\X) - H{a,b\X) 

= H{X) - H{a, A) - H{b, A) + H{a, b, A) . (2) 

Since H{a\X) < H{a), one might be tempted to assume that also H{a : b\X) < H{a : 
6), but this is not generally true. One can construct examples for which CMI is greater 
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or smaller than MI, a fact well known since the early days of Classical Information 
Theory i, [§. 

One can define analogous quantities for Quantum Physics. Suppose Pa,b,x £ 
dm(T-Ca,b,x) , with partial traces pa € dm(7ia), etc. Then we define 

Sia : b) = S{pa) + S{pb) - S{pa,b) , (3) 

and 

S{a : b\X) = S{px) - - 5(p,,a) + S{pa,b,x) ■ (4) 

2 Entanglement of Formation 

Before racing off at full speed, let's warm up with a brief review of the CMI definition 
oiEp. 

Consider the Classical Bayesian Net shown in Fig.|l|. It represents a probability 



a 




Figure 1: Classical Bayesian Net that motivates the definition of Ep. 
distribution of the form: 

P(a, b, a) = P{a\a)P{b\a)P{a) . (5) 

One can easily check that for this probability distribution, H{a : b\a) is identically 
zero. 

In the classical case, we define Ep by 

EF{Pa,b) = min H{a : b\a) , (6) 

where K is the set of all probability distributions Pa,b,a with a fixed marginal Pa^b- 
Thus, Ep is a function of Pa^b- If K contains a Pa,b,a of the form given by Eq.(^), then 
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Ep = 0. This is always true if K is defined to contain all probability distributions 
with arbitrary positive values of N^- But it may not be true if K contains only 
probability distributions with a fixed iV„ value. The fact that the right hand side 
of Eq.(|^) vanishes in the classical case (if K includes all A^^^ values) is an important 
motivation for defining Ep this way. We want a measure of entanglement that is 
exclusively quantum. 

In the quantum case, suppose {wa '■ a = 1, . . . , A'^} is a probability distri- 
bution for a, and : a = 1, . . . , Na} is an ortho normal basis for Ti^- For all a, 
suppose p° G dm(7ia), and G dm(7ife). Consider a "separable" density matrix 
Pa,b,a of the form 

Pa,b,a = IIPaP>al")("l • (7) 

a 

One can easily check that for this density matrix, S{a : b\a) = 0. 
In the quantum case, we define Ep by 

Ep{pa,b) = min S{a:b\a), (8) 

Pa,b,a&K 

where K equals the set Kq = {pa,b,a £ dm{T-Ca,b,a) '■ arbitrary N^, fixed marginal Pa,b}- 
Thus, Ep is a function of pa^b- If K contains a Pa,b,a of the form given by Eq.(0), then 
Ep is zero. The quantum Ep can be nonzero even if K contains all density matrices 
with arbitrary Na values. 

In Eq.(|^), we could set K = K2, where K2 is the subset of Kq which restricts 
Pa,b,a to be of the form 

Pik,b& = IC,fe)(C,bkal«)(«l , (9) 

a 

where l^p^b) ^ '^a,b- One can show that Eq.(|) with K = K2 is identical (up to a 
factor of 2) to the definition of Ep originally given by Bennett et al in Ref. [Q]. Other 
possible K choices come to mind. For example, one could set K equal to Ki, where 
Ki is that subset of Kq which restricts Pa,b,a to be of the form 

a 

where G dm(T-Ca,b) need not be pure. Ko,Ki,K2 represent different degrees of 
information about how pa,b,a was created. Kq represents total ignorance. 

3 Classical Distillation 

In this section, we will define a classical Ep). In the next section, we will find a 
quantum counterpart for it. 

Consider the Classical Bayesian Net of Fig.|^. The arrow from (a, a') to {b, V) 
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Figure 2: Classical Bayesian Net that motivates the definition of Er,. 



allows what is often referred to as "classical communication from Alice to Bob" . Let 
X = {AB), ]C = Let Na = Na' = Na = K' and Nb = % = iVfe = A/^'- 

The net of Fig.^ satisfies: 

P(a, b, a, b') = J2 ^(«' ^')P{b, b'\B, B', a, a)P{X)P{X') , (11) 



X,X' 



where 



P{X) = J2P{A\a)P{B\a)P{a) 



;i2a) 



and 



P{X') = J2PiA'\a')P{B'\a')P{a') . 



(12b) 



We wish to consider only those experiment in which a' and 1/ are both fixed at a 
known value, call it for definiteness. For such experiments, one considers: 



P{a,b\a' = b' = 0) 



P{a,b, a' = 0,b' = 0) 



(13) 



P(a' = 0, 6' = 0) 

Henceforth we will use F as a short-hand for the string "a' = 0,6' = 0". We will also 
use U to denote Pa,a'\A,A' and V to denote Pb,b'\B.B',a,a'- 
In the classical case, we define Ejj by 



Ed{Px, Px') = max min H{a : b\\, F) 



(14) 



5 



where K is the set of all probability distributions Pa,fe,A|r with a fixed marginal Pa,b\T 
that satisfies Eq.(|ll]). Pa,h\T depends on Px_, Px', U, V. Since we maximize over U, V, 
Ed is a function of Px and Px'- 

Next we will show that the net of Fig.^, without the classical communi- 
cation arrow, satisfies: 

Ed{Px, Px') < Ef{Px) + Ef{Px!) ■ (1^) 
Suppose we could show that 

H{a:b\a,dJ) < H{{A,A!) : {B,B^)\a,d) . (16) 
After taking limits on the left hand side, this gives 

Ed < H{A,A ■ B,B!\a,a!) ■ (17) 
Note that by the independence of the prime and unprimed variables 

H{AA! : B,B!\a,d) = H{A : B\d} + H{A : B'|a') . (18) 

Eqs.([T7|) and ([T8|) imply Eq.(|l5]). So let us concentrate on establishing Eq. ([T6|) . Events 
A,A!,B_,B^ all occur before F so they are independent of F. Therefore, we can write: 

H{A,A! : B,B!\a,d) = H{A,A! : 5,fi'|a,a',F) . (19) 
Because of Eq.(|TU|), Eqs.(|TB]) is equivalent to: 

i7(a: 6|a,a',F) < H {A, A! : B, B!\a, d J) ■ (20) 

Eq.(pOD follows easily from the following Lemma, which is proven in Appendix^ 
Lemma: The net of Fig.^ satisfies 

H{a : b\X) < H{x : y\X) . (21) 



4 Quantum Distillation 

In this section, we will give a quantum counterpart of the classical Ed defined in the 
previous section. 

As in the classical case, let X = {A,B), T = (A', 5'). Let Na = Na' = Na = 
Ng/ and Nb_ = Nb' = Nb = N^'. Suppose px ^ dm(7Yx) and px' G dm{Hx') are 
given. Suppose Ua^a'\A,A' is a unitary transformation mapping Ha,a' onto 'Ha,g/- 

UiA'la_,ajU^,^'\A,A' = ^ ' (22) 
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Figure 3: Classical Bayesian Net that obeys the Data Processing Inequality Eg . (|2T|) . 

Likewise, suppose that for each a, ^,is a unitary transformation mapping Hb^b' 

onto Hh h': 

^^B'\b,b!^b,i^\B,B! = 1 • (23) 
Define the following projector on TYq,: 

K = \a)a{a\a ■ (24) 
Now consider the following density matrix 



P«,b|r = -^r^ II(Oa', %'\KUa,a'\A,A'V^,b!\B,B:PxPx'VB^B'M^ ' (25) 

where P(T) is defined so that iT^a,b{Pa,b\r) = 1- The previous equation can also be 
expressed in index notation as: 



{a,bi\pa,b\r\a,b2) = -^^{a,a' = 0\Ua^a'\A,A'\A, A) (26) 

{bub' = OK^j,,^B,B'\BuB[) (27) 

{Au Bi\px\~A2, B2) (28) 

{A[,B[\px'\A',,B!,) (29) 

{B2,B',\Vl^^,^,,,\b2,b' = 0) (30) 

{A2,A',\Ul^,^^^Ja,a' = 0). (31) 



Finally, we define ii^D by 
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Ed{Px,Px') = max min S{a: b\X, T) , (32) 

— — U,V Pa,b,\\r<^K 

where K contains all density matrices pa b x\r with a fixed marginal pa b\r that satisfies 
Eq.(|5D. 



A Appendix: Some Data Processing Inequalities 

In this appendix, we will prove two well known Data Processing Inequalities. 

If P,Q G pd(S'2;), then the relative entropy (or KuUback-Leibler distance) 
between P and Q is 

D{P//Q) = Y: P{x) \n[P{x)/Q{x)] . (33) 



Lemma A.l (Data Processing Inequality for Relative Entropy, see Ref.j^) If P,Q G 
pd(S'^) and T = {T{y\x) : y G Sy,x G Sx} is a matrix of non-negative numbers such 
that Y.yT{y\x) = 1, then 

D{P//Q) > D{TP//TQ) , (34) 

where TP should he understood as the matrix product of the column vector P times 
the matrix T . 



proof: 

One has 



D{P//Q) = E^(^)ln( 



Fix) 



(35) 



i:i:T(.i.)PM.n(i™) (3, 



y ^ 



> j:TP{y)\ni^^^^=D{TP//TQ). (37) 

Line (^) follows from an application of the Log-Sum Inequality QED 
Lemma A. 2 (Data Processing Inequality for CMI) The net of Fig^ satisfies 

H{x : y\X) > H{a : b\X) . (38) 

proof: 

The net of Fig.^ represent a probability distribution of the form: 

P(a, b, X, y, A) = P{a\x)P{b\y)P{x\X)P{y\\)P{X) . (39) 
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One can easily show that such a probabihty distribution satisfies: 

P{a\X) = Y,P{a\x)P{x\X) , (40a) 

X 

P{b\X) = J2P{b\y)P{y\X), (40b) 

y 

P{a,b\X) = P{a\X)P{b\X) , (41a) 

P{x,y\X) = P{x\X)P{y\X) . (41b) 

For any two random variables X,x, let be shorthand for P^\x=\- In other 
words, Q^{x) = P^\x{x\X) for all x,X. The two CMI we are dealing with can be 
rewritten in terms of relative entropy as follows: 

H{x ■.y\X)=Y: P{X)D{QlJ/QlQl) , (42) 

A 

and 

H{a : 6|A) = E PWDiQiJ/QtQt) ■ (43) 

A 

Thus, if we can show that 

D{Qy/Q^Qy) > D{QlJ/QlQl) , (44) 

then the present Lemma will be proven. The last inequality will follow from Lemma 
A.l| if we can find a transition probability matrix T(a, y) such that 



P(a, b\X) = Y. T{a, b\x, y)P{x, y\X) , (45a) 
x,y 

and 

P{a\X)P{b\X) = ET(a,&|x,y)P(x|A)P(y|A) . (45b) 



Eqs. (P^l) follow easily from Eqs.(^D|) and (5T), with T given by : 

T[aMx.y)=P{a\x)P{b\y). (46) 

QED 
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